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SYSTEM AND METHOD FOR IMPLEMENTING A SKEW-TOLERANT 
TRUE -SINGLE -PHASE -CLOCKING 
FLIP-FLOP 



BACKGROUND OF THE PRESENT INVENTION 
Field of the Invention 

The present invention relates generally to systems 
and methods for designing synchronous digital circuits, 
and specifically to designing a skew-tolerant true- 
single-phase clocking flip-flop. 
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Background and Objects of the Present Invention 

Clock signals are used in synchronous digital 
circuits to allow different parts of the circuit to 
communicate with each other without data loss. Values 
5 carried by data signals are defined at specific times 

relative to the transitions of the clock signals, and are 
read into and out of storage elements at times related to 
these same transitions. The clock signals are typically 

a! distributed to all storage elements in the design. The 

W 

Q 10 inevitable signal delay caused by the distribution 

in 

ifl network must be taken into account during its design, 

xp such that the clock signal arrives at all storage 

□ elements at substantially the same time. 

m 

O The required degree of simultaneity may be 

in 

*0 15 quantified with the setup time and hold time for the 

clocked devices, such as flip-flops. In order to meet 
the required setup time for a flip-flop device, input 
data must be present at the data input lead of the flip- 
flop device and in stable form for a predetermined amount 
20 of time before the clock transition. In order to meet 

the hold time requirement for the flip-flop device, the 
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data must be stable from the time of the clock transition 
on arrival at the control lead of the flip-flop up to a 
certain time interval after the arrival of the clock. 

Many setup time violations may be remedied simply by 
5 slowing down the frequency at which the design is 

clocked. However, hold time violations may be caused by 
clock skew, and can persist regardless of the clock 
frequency. Insufficient control of the clock delay in 
different parts of the clock distribution network may 

10 cause the clock edges to arrive at the clocked devices at 

different times. These time differences constitute the 
clock skew, which, if large enough to cause hold 
violations (or setup violations if the clock frequency is 
high enough), can cause circuit malfunction. For 

15 example, the clock skew may cause data in a first 
register to shift earlier than data on a second register. 
The hold time requirement of the second register may, 
therefore, be violated, and data bits may then be lost . 
A dominant clocking strategy presently used for 

20 digital circuits is single-phase clocking with true- 

single-phase-clocking (TSPC) flip-flops . Single-phase 
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clocking systems use only one clock signal, and 
therefore, only one clock distribution network, which 
avoids the task of controlling clock skew among several 
networks. The only clock skew present in a single-phase- 
5 clocking system is that created among several instances 

of the same signal which have suffered different amounts 
of delay through different paths in the distribution 
network. Therefore, the simplified clock skew management 
task in single-phase-clocking systems allows for high 

10 clock frequencies compared with previously utilized 

clocking techniques. These benefits often outweigh the 
drawbacks, one of which is a stringent requirement on the 
clock transition speed. 

Current schemes for distributing clock signals to 

15 storage elements concentrate on ensuring synchronicity of 

all clock signals. Clocks are typically distributed in 
a tree-like structure, whereby delays in different 
branches can be balanced to a high degree. The highly 
balanced clock trees traditionally used with single-phase 

20 clocking cause all TSPC flip-flops in the design to 

toggle virtually simultaneously. The capacitive loads 
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driven by the flip-flop outputs are then charged 
simultaneously, drawing a large current spike from the 
supply. 

Such current spikes are undesirable due to the 
5 resulting metal migration in supply wires. The rate of 

migration depends strongly upon the maximum current 
density that occurs in the wire. ' Large current spikes 
thus require wider supply wires with the concomitant cost 
in area. In addition, large current spikes feature large 

10 values of dl/dt. Together with the parasitic inductance 

present in the IC package, the current spikes thus cause 
voltage fluctuations on the supply lines. These 
fluctuations can cause both malfunction of the digital 
circuits and reduced performance level of co-located 

15 analog circuity. Although the aforementioned problems 

may be corrected by advanced packaging and on-chip 
decoupling capacitance, both of these methods increase 
the cost of the device. Furthermore, the large current 
spikes themselves can couple inductively into other parts 

20 of the design and cause malfunction or performance 

reduction. The aforementioned electrical problems can 
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potentially be alleviated by introducing a controlled 
amount of clock skew, such that not all storage elements 
change value at the same time. However, such 

purposefully added skew could create logical malfunctions 
5 due to hold violations, as described above. 

It is, therefore, an object of the present invention 
to provide a skew-tolerant TSPC flip-flop that reduces 
the overall current spike. 

10 SUMMARY OF THE INVENTION 

A preferred embodiment of the present invention is 
directed to a skew-tolerant TSPC flip-flop that reduces 
the overall current spike by allowing willful 
introduction of skew in the clock tree of a single-phase 

15 design. More precisely, a split-clock TSPC flip-flop, 

which allows the flip-flop hold times to be met even with 
skewed clocks, can be substituted for a traditional TSPC 
flip-flop in a sequential logic circuit. By introducing 
clock skew, different flip-flops toggle at slightly 

20 different times, such that the corresponding current 

spikes are slightly staggered in time. Therefore, the 
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clock skew can serve to "smear out" the overall current 
spike, which reduces the maximum value of the current 
spike as well as the maximum value of dX/dt. The input 
of the split-clock TSPC flip-flop can be latched 
5 according to a first clock signal, which was used in a 

preceding stage, while the output of the split-clock TSPC 
flip-flop can be driven according to a second clock 
signal. The first and second clock signals are skewed in 

P 

time, but have the same frequency and substantially the 

lr 2 j 

□ 10 same phase. In one embodiment, an additional Metal Oxide 

U = 

*T* Semiconductor (MOS) device can be included within the 

m 

;5 split-clock TSPC flip-flop to reduce power dissipation in 

□ cases of large clock skew. 

m 
q 
m 

*0 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The disclosed invention will be described with 
reference to the accompanying drawings, which show 
important sample embodiments of the invention and which 
5 are incorporated in the specification hereof by 

reference, wherein: 

FIGURE 1 is a schematic diagram of one pipeline 
stage in a digital sequential logic circuit; 

FIGURE 2 is a timing diagram illustrating proper 
10 setup and hold times associated with the circuit shown in 

FIGURE 1 of the drawings; 

FIGURE 3 is a schematic diagram of one pipeline 
stage in a digital sequential logic circuit with skewed 
clocks; 

15 FIGURE 4 is a timing diagram illustrating proper 

setup and hold times associated with the circuit shown in 
FIGURE 3 of the drawings; 

FIGURE 5 illustrates a split-clock, skew-tolerant 
true-single-phase-clocking (TSPC) flip-flop in accordance 

20 with embodiments of the present invention; 
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FIGURE 6 is a schematic diagram of one pipeline 
stage in a digital sequential logic circuit with a split- 
clock TSPC flip-flop in accordance with embodiments of 
the present invention; 
5 FIGURE 7 is a timing diagram illustrating proper 

setup and hold times associated with the circuit shown in 
FIGURE 6 of the drawings; and 

FIGURE 8 illustrates a split-clock TSPC flip-flop 

O 

*0 having an anti-short-circuit-current device in accordance 

y 

□ 10 with embodiments of the present invention. 

m 
m 
m 

ti 

0 

in 
o 

s 

^0 
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DETAILED DESCRIPTION OF THE PRESENTLY PREFERRED EXEMPLARY 
EMBODIMENTS 



The numerous innovative teachings of the present 
5 application will be described with particular reference 

to the presently preferred exemplary embodiments. 
However, it should be understood that this class of 
embodiments provides only a few examples of the many 
advantageous uses of the innovative teachings herein. In 

O 

10 general, statements made in the specification of the 

•■555 

y present application do not necessarily delimit any of the 

various claimed inventions. Moreover, some statements 
may apply to some inventive features but not to others. 
O With reference now to FIGURE 1 of the drawings, a 

in 

*3 15 sequential logic circuit is shown having a first D type 

flip-flop (D-FF) 10, which has a data input D for 

'B7 

receiving an input signal "in," a clock input 18 for 
receiving a clock signal $ and a data output Q for 
transmitting an output signal "out." Data is clocked 
20 into the D-FF 10 in accordance with well-known methods on 

the rising or falling edge of a clock signal * generated 
by a clock 20 along a line 12 leading to the clock input 



m 
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18, Data propagates through combinational logic 14 
connected to the D-FF 10 over line "a" in accordance with 
conventional methods, and the output of the combinational 
logic 14 is clocked into a second D-FF 16 over line "b." 
5 The timing diagram of FIGURE 2 shows that in order 

for the circuit of FIGURE 1 to operate properly, data 
must be stable on the input D of the D-FF 16 relative to 
the arrival of the clock signal * at the clock input 18 
for a specified period of time. As shown in FIGURE 2, 

10 for any clocked device, there is a specified "setup time" 

S and a specified "hold time" H. Input data must be 
present and stable from at least the setup time S before 
the clock transition until a period equal to at least the 
hold time H after the clock transition, for proper 

15 operation. 

For example, in FIGURE 2, Dl is the intrinsic delay 
in the first D-FF 10, while D2 is the delay through the 
combinational logic 14. It should be noted that D2 can 
take on a range of values depending upon the function 

20 performed by the combinational logic 14. In addition, S, 

H, and Dl are data-dependent, but can be considered 
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constant in FIGURE 2. In order to prevent a setup 
violation, it is required that the clock period T satisfy 
the following equation: 

5 T > Dl + max(D2) + S (1) 

The maximum delay of the combinational logic 14 typically 
dominates Dl and S. In addition, in order to prevent a 
hold violation, the hold time H must satisfy the 
10 following equation: 

H < Dl + min(D2) (2) 

As min(D2) can be very small, it is frequently more 
15 difficult to guarantee the hold condition than the setup 

condition, especially when T is not very small. 

With reference now to FIGURE 3 of the drawings, the 

same circuit as in FIGURE 1 is shown, but with skewed 

clocks $ a and 3> b being transmitted over lines 12a and 12b, 
20 respectively. As shown in the timing diagram of FIGURE 

4, which corresponds with the circuit shown in FIGURE 3, 
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if the clock skew "d" is positive, the setup requirement 
becomes more difficult to satisfy and the hold 
requirement becomes easier to satisfy. However, the 
opposite holds true if d is negative. For example, the 
5 setup requirement is now: 

T - d > Dl + max (D2) + S. (3) 

This setup requirement is stricter if d is positive. 
10 However, since max(D2) is likely to be much larger than 

d, it is still possible to meet the requirement by 
adjusting max(D2) by modifying the implementation of the 
combinational logic block. In addition, the hold 
requirement is now: 

15 

H < Dl + min(D2) + d. (4) 

This hold requirement is stricter if d is negative. 
Furthermore, if the magnitude of d is sufficiently large, 
20 it will be virtually impossible to meet the hold 

requirement. The clock skew can usually be held low 
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enough in conventional single-phase logic design, but the 
amount of extra clock skew which can be introduced to 
address the problem of current spike is necessarily 
limited. 

5 Therefore, in accordance with aspects of the present 

invention, in order to reduce the current spike seen in 
typical single-phase circuit designs, clock skew is 
purposefully introduced without causing a hold violation 
by utilizing a modified true-single-phase-clocking (TSPC) 

10 flip-flop as the second D-FF 16. The introduction of 

clock skew causes different flip-flops 10 and 16 to 
toggle at slightly different times, such that the 
corresponding current spikes are slightly staggered in 
time. The clock skew can therefore serve to "smear out" 

15 the overall current spike, reducing its maximum value as 

well as the maximum value of dl/dt. Furthermore, this 
willful introduction of clock skew does not prevent the 
second D-FF 16 hold time from being met. 

Thus, with reference now to FIGURE 5 of the 

20 drawings, in accordance with embodiments of the present 

invention, one type of a nine-transistor M1-M9 split- 
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clock TSPC flip-flop 16 is shown, which can be 
substituted for the second D-FF 16 shown in FIGURES 1 and 
3 of the drawings. In operation, at the rising edge of 
the first clock signal O a of the split clock TSPC flip- 
5 flop 16, the input signal "in" at the input D is isolated 

from the rest of the circuit 16 as two p-type field 
effect transistors (pFETs) Ml and M4 turn off. Node B 
cannot be pulled up since M4 is off, and node A cannot be 

O 

*A charged up since Ml is off. Therefore, node B cannot be 

W 

O 10 discharged as a consequence of a falling input. 

Ill 

W 1 At the rising edge of the second clock signal 4> b , 

m 

>P which has the same frequency and substantially the same 

13 phase as the first clock signal * a , but is preferably 

|3 slightly skewed in arrival time from the first clock 



^ 15 signal 4> a , the previous data stored in the flip-flop 16 

is driven to the output Q. This is accomplished by n- 
type field effect transistors (nFETs) M9 and M6 turning 
on, which allows the output Q to go low if node B is 
high, and alternatively allows node B to go low and the 
20 output Q to go high if node A is high. Therefore, at 

the rising edge of the first clock 4> a , the pFETs Ml and 
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M4 isolate the input signal "in," while at the rising 
edge of the second clock <D b , the nFETs M9 and M6 drive the 
stored data to the output Q to be transmitted by the 
flip-flop 16 as output signal "out." By having the pFETs 
5 Ml and M4 and the nFETs M6 and M9 controlled by two 

separate clock signals <£> a and 3> b , respectively, which are 
slightly skewed, the overall current spike is reduced 
without violating TSPC flip-flop 16 hold times. 

Q 

^2 The use of separate clocks $ a and $ b for input and 

W 

B 10 output control, respectively, does not correspond to the 

is- 

:r - 

two-phase clock designs that predated the single-phase 

IS 

s P paradigm. Two-phase logic styles with good skew 

O tolerance have been described, for example, in the 

III 

□ following article by N.F. Goncalves and H. De Man: NORA: 

111 

5 0 15 a Racefree Dynamic CMOS Technique for Pipelined Logic 

*D 

Structures; IEEE JSSC, Vol. SC-18, No. 3, pp. 261-66; 
June 1983. However, the two-phase clock designs 
described in the Goncalves article were not designed to 
handle the case when the two clock phases were identical 
20 or nearly identical. In contrast, the split clock flip- 
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flop 16 described herein handles both small positive and 
negative skews, as well as "zero" skew. 

With reference now to FIGURES 6 and 7 of the 
drawings, in order to use the split clock flip-flop 16 of 
5 the present invention in a pipeline stage of a digital 

circuit, the pFETs (Ml and M4 shown in FIGURE 5) of the 
second TSPC flip-flop 16 are controlled by the same clock 
signal $ a as that used for the previous pipeline stage 
(TSPC flip-flop 10) . Thus, the second TSPC flip-flop 16 

10 has its output Q driven according to the potentially 

delayed clock 4> b , which is transmitted over line 12b to 
clock input 18b, but its input D latched according to the 
potentially earlier clock 4> a , which is transmitted over 
line 12a to clock input 18a. It should be noted that the 

15 first TSPC flip-flop 10 can also be a split-clock flip- 

flop, as long as the output Q of the first TSPC flip-flop 
10 is controlled by the same clock <t> a as the input D of 
the second TSPC flip-flop 16. 

However, any significant skew between the two clocks 

20 shown in FIGURES 5-7 has an undesired side effect in the 

second TSPC flip-flop 16. If there is significant skew, 
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the input clock $ a of the second TSPC flip-flop 16 may 
fall before the output clock 3> b of the second TSPC flip- 
flop 16, and M6 and M4 will both be on simultaneously for 
a short interval. If node A is high, short-circuit 
5 current will be drawn through M4, M5 and M6. 

Thus, with reference now to FIGURE 8 of the 
drawings, an additional nFET M10, which is controlled by 
the input clock 3> a , can be introduced in series with M4, 
M5, and M6. The additional nFET M10 serves to turn off 

10 the short-circuit path through M4, M5 and M6 when the 

input clock 3> a rises, which reduces the power dissipation 
in cases of large clock skew. Although the additional 
nFET M10 is shown between M4 and M5, it should be noted 
that the best position of the nFET M10 with respect to 

15 M4, M5 and M6 depends upon specific detailed circuit 
parameters. The defining aspect is that the additional 
nFET mlO is controlled by the input clock O a and that it 
serves to turn off the short-circuit path. 

Furthermore, it should be noted that the present 

20 invention is not limited to the particular type of TSPC 

flip-flop 16 discussed herein. Instead, the present 
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invention can be applied whenever two classes of clocked 
devices, which handle the timing on the input side and on 
the output side of the flip-flop 16, respectively, are 
identifiable. For example, the input stage of the flip- 
5 flop 16 could be controlled by nFETs, while the output 

stage could be controlled by pFETs. 

As will be recognized by those skilled in the art, 
the innovative concepts described in the present 
application can be modified and varied over a wide range 
10 of applications. Accordingly, the scope of patented 

subject matter should not be limited to any of the 
specific exemplary teachings discussed, but is instead 
defined by the following claims. 
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